SWEET ontology coverage for earth system sciences

نویسندگان

  • Nicholas DiGiuseppe
  • Line C. Pouchard
  • Natalya Fridman Noy
چکیده

Scientists in the Earth and Environmental Sciences (EES) domain increasingly use ontologies to analyze and integrate their data. For example, the NASA’s SWEET ontologies (Semantic Web for Earth and Environmental Terminology) have become the de facto standard ontologies to represent the EES domain formally (Raskin 2010). Now we must develop principled ways both to evaluate existing ontologies and to ascertain their quality in a quantitative manner. Existing literature describes many potential quality metrics for ontologies. Among these metrics is the coverage metric, which approximates the relevancy of an ontology to a corpus (Yao et al. (PLoS Comput Biol 7(1):e1001055+, 2011)). This paper has three primary contributions to the EES domain: (1) we present an investigation of the applicability of existing coverage techniques for the EES domain; (2) we present a novel expansion of existing techniques that uses thesauri to generate equivalence and subclass axioms automatically; and (3) we present an experiment to establish an upper-bound coverage expectation for the SWEET ontologies against real-world EES corpora from DataONE (Michener et al. (Ecol Inform 11:5–15, 2012)), Communicated by: H. A. Babaie N. DiGiuseppe ( ) University of California, Irvine Irvine, CA 92617-3440, USA e-mail: [email protected] L. C. Pouchard Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, TN 37831, USA e-mail: [email protected] N. F. Noy Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, CA 94070, USA e-mail: [email protected] and a corpus designed from research articles to specifically match the topics covered by the SWEET ontologies. This initial evaluation suggests that the SWEET ontology can accurately represent real corpora within the EES domain.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Linked Science investigation: enhancing climate change data discovery with semantic technologies

Linked Science is the practice of inter-connecting scientific assets by publishing, sharing and linking scientific data and processes in end-to-end loosely coupled workflows that allow the sharing and re-use of scientific data. Much of this data does not live in the cloud or on the Web, but rather in multi-institutional data centers that provide tools and add value through quality assurance, va...

متن کامل

Semantic Web for Earth and Environmental Terminology

The Semantic Web for Earth and Environmental Terminology (SWEET) is a prototype for improving the discovery and use of Earth science data, through software understanding of the semantics of web resources. The semantic understanding is aided by the use of ontologies, or formal representations of technical concepts and their interrelations in a form that supports domain knowledge. The ultimate vi...

متن کامل

Knowledge representation in the semantic web for Earth and environmental terminology (SWEET)

In this presentation, we describe our experiences with building and using large ontologies, with application to locating NASA Earth science data. We use OWL to represent the mutual relationships of scientific concepts and their ancillary space, time, and environmental descriptors.

متن کامل

The Earth System Grid Discovery and Semantic Web Technologies

The Earth System Grid (ESG) is developing a virtual environment based on Grid technologies for the earth sciences and others analyzing the impacts of global climate changes. The goal of ESG is to provide discovery and secure access to very large datasets for earth sciences research. Data discovery through the use of metadata has become a major focus of ESG. Metadata schemas, a prototype ontolog...

متن کامل

Comparative genomics analysis in Prunoideae to identify biologically relevant polymorphisms.

Prunus is an economically important genus with a wide range of physiological and biological variability. Using the peach genome as a reference, sequencing reads from four almond accessions and one sweet cherry cultivar were used for comparative analysis of these three Prunus species. Reference mapping enabled the identification of many biological relevant polymorphisms within the individuals. E...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Earth Science Informatics

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2014